COMPUTING IN 
UNIPOTENT AND REDUCTIVE ALGEBRAIC GROUPS 

ARJEH M. COHEN, SERGEI HALLER, AND SCOTT H. MURRAY 



Abstract. The unipotent groups are an important class of algebraic groups. 
We show that techniques used to compute with finitely generated nilpotent 
groups carry over to unipotent groups. We concentrate particularly on the 
maximal unipotent subgroup of a split reductive group and show how this 
improves computation in the reductive group itself. 



1. Introduction 

A linear algebraic group is unipotent if its elements are unipotent (ie, the ele- 
ments only have eigenvalue one in every representation). Unipotent groups play a 
prominent role in the theory of algebraic groups. In this paper, we present algo- 
rithms for efhcient element operations in unipotent groups. Since unipotent groups 
are nilpotent, we adapt methods used for computing in finitely generated nilpotent 
groups. A PC group provides a unique computer representation for the elements of 
nilpotent groups (see, for example, [HEO05] ). Collection gives efficient algorithms 
for multiplication and inversion of group elements. We modify these concepts to 
work with a large class of unipotent groups defined over a field. This class contains 
all unipotent groups if the field has characteristic zero. It also contains the full 
unipotent subgroup of every split reductive group. 

Algorithms for element operations in split reductive algebraic groups are given 
in [CMT04 ]. Computations in the unipotent subgroup make the largest single 
contribution to the time taken by these algorithms. This was the main impetus for 
the current paper, in which we prove: 

Theorem 1.1. Let ¥ be a field with effective algorithms for the basic element 
operations. Let G be a split reductive algebraic group over ¥ with rank n. Then 
there is a normal form for elements of G(F). The word problem for elements in 
normal form requires O(n^) field operations, and multiplying or inverting them 
requires 0(n^) field operations. 

This theorem is a great improvement over the analysis of [CMT04] , where we proved 
that the operations are polynomial time, but did not compute the exponent. This 
result is optimal in the sense that the timings are asymptotically the same as the 
straightforward methods for matrices. 

In Section \2\ we construct FC group schemes, which give normal forms for ele- 
ments of unipotent groups. We adapt two basic collection strategies in Section OS 
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We have developed two new algorithms for the full unipotent subgroup of a split 
reductive algebraic group: The first is a new collection strategy called collection 
from the outside (Section 3}. The second is a direct method for computing prod- 
ucts and inverses in classical groups using standard representations (Section E}- 
Section [6] gives the asymptotic analysis and the proof of Theorem l.lj Section \7] 
compares practical timings for the various methods considered and describes the 
default method used in Magma [ BCP97] from version 2.13. 

2. Unipotent groups and presentations 

Throughout this paper, F is a field and E is a commutative unital algebra over 
F. We assume that we have effective algorithms for the basic element operations in 
F and E. We find it convenient to use the scheme-theoretic definition of algebraic 
groups. So an algebraic group defined over F is a functor from the category of 
all commutative unital algebras over F to the category of groups, satisfying the 
appropriate additional conditions [DG70 i fWatTQ] . If G is an algebraic group, then 
G(E) is an abstract group, called the rational point group of G over E. For example, 
the additive group Ga has rational point group Ga(E) — E+. 

We define an ¥ -unipotent group to be an algebraic group defined over F with a 
normal series in which every quotient is isomorphic to Ga. This is analogous to the 
definition of an F-soluble algebraic group in [ DG70] . 

Proposition 2.1. Over a perfect field ¥, every connected unipotent group is F- 
unipotent. Over a field F of characteristic zero, every unipotent group is ¥ -unipotent. 

Proof. The first statement follows from [Ros57l Proposition 5, Corollary 2]. The 
second follows from the first using the standard fact that all unipotent groups are 
connected in characteristic zero. □ 



The group a.p, defined in [Wat 79] over fields of characteristic p > 0, is unipotent 
but not F- unipotent. 

Let U be an F-unipotent group. Fix a central series 

[/ = C/i > C/2 > ••• > Un+1 = 1, 

such that each Ur/Ur+i is F-isomorphic to Ga. The projection Ur — > Ur/Ur+i = Ga 
splits as an F-morphism of schemes by |Spr98[ Theorem 16.2.6]. Fix splitting maps 
Xr ■ Ga — > Ur- Clearly U is parametrised by iV-dimensional affine space: 

U, (ai, . . . ,aAr) ^ Xi{ai) ■ ■ •XAr(aAr). 

Multiplication and inversion in C/(E) are given by polynomials. To be precise 

N N N 

(1) J]^ Xr{ar) W Xr{br) = Xr{Fr{ai, . . . , flAT , 61 , . . . , ^Af)), 

r— 1 r— 1 r— 1 

CN \~^N 
Y\_Xr{ar)\ = Y\_Xr{Gr{ai, . . . ,aN)), 
r=l / r=l 

where all products are written in ascending order, each Fr is a polynomial in 2N 
indeterminates, and each Gr is a polynomial in TV indeterminates. The and Gr 
are called Hall polynomials [Hal69] . These polynomials have coefhcients in F, but 
tend to be very large and unwieldy. In order to do practical computations in [/, we 
need a more concise description. 
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We now construct a presentation for the rational point group C/(E). Since the 
series U{E) = Ui{E) > C/2(E) > • • • > Un{E) > Un+i{E) = 1 is central, we have 

Xr{a)xr{b) e Xr(a + 6) [/,■+! (E), 

Xria)~^ e Xr{ — a)Ur+l{E), 

Xs{b)xr{a) e Xr{a)xs{b)Us+i{'E), 
for a, 6 e E and 1 < ?- < s < A^. Hence we have relations 

N 

(3) Xr{a)xr{b) = Xr{a + b) Y\_ xt{frt{a,b)), 

N 

(4) Xr{a)~^ ^ Xr{-a) Y[ xt{grt{a)), 

t=r+l 

N 

(5) Xs{b)xr{a) = Xr{a)xs{b) Xt{hrst{a,b)), 

where frt , Qrt , and hrst sue polynomials defined over F. We note that many of these 
relations are redundant (including (0| for all r), but the extra relations are useful 
for computation. 

Theorem 2.2. Let U be an ¥ -unipotent group and let E &e a commutative F- 
algebra. Let t/(E) be the group with generators Xr{a), for a G E, r — 1, . . . , N , and 
relations [3\), (4^, and (5\), for a, 6 G E, 1 < r < s < N . Then the natural map 
[/(E) [/(E) is an isomorphism of abstract groups. 

Proof. Since the given relations hold in [/(E), the map is well defined. The map 
is onto because [/(E) is generated by the images of the generators of [/(E). Every 
element of [/(E) is a word with terms of the form Xr{a) or Xr{a)~^. This word 
can be collected into a product Xr{ar). This is achieved by first eliminating 

all inverses using (4}, then putting the terms in order by the subscripts using 
(3) and removing multiple terms with the same subscript using (3}. If the words 
Yir-i ^riflr) and rir=i Xr{br) are equal in [/(E), then Or — br for all r, and so these 
words are also equal in [/(E). Hence the map is injective and we are done. □ 

We say that the group scheme U is presented by U. 

Now suppose we are given an arbitrary system of F-polynomials frt{o,,b) and 
grt{a, b), for 1 < r < t < N; and hrstia), for l<r<s<t<N. Define the group 
functor U by taking [/(E) to be the abstract group given by generators Xr{a), for 
a S E, r = 1, . . . , iV, and relations f3j, (S}, and (3). We call U an FC group functor 
over F. FC stands for field-commutator, since the relations involve field operations 
and commutators, just as the PC presentation of a nilpotent group involves powers 
and commutators. We call U consistent if the map 

E^ U{E), (fli, . . . , ajv) 1-^ Xi{ai) • • ■ XAr(aAr) 

is injective for every commutative F-algebra E. Theorem |2.2 implies that every 
F-unipotent group is presented by a consistent FC group functor. We now prove 
the converse: 

Theorem 2.3. Every consistent FC group functor defined over F is an ¥ -unipotent 
group. 
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Proof. Let U be the consistent FC group functor. Ignoring the multiphcation, we 
can consider U to be the TV-dimensional afhne scheme. Using collection, as in the 
proof of Theorem 2.21 we can find polynomials Fr and Gr such that equations ({T]) 
and are satisfied in [/(E). So U is an F-algebraic group scheme, since Fr and Gr 
are clearly defined over F. Finally define algebraic subgroups Ur — Ylk=r^^(^r)- 
These give a normal series for U in which every quotient is isomorphic to Ga, and 
so U is an F-unipotent group. □ 

Let U be an F-unipotent group. Suppose the projection Ur Ga splits as a 
homomorphism of F-group schemes, not just as a morphism of F-schemes. Then 
we can take Xr ■ Ga — > Ur to be a homomorphism, and so replace fSj and ([IJ by 

(6) Xr{a)xr{b) = Xr{a + b). 
It follows immediately that 

(7) Xr{ay^ = Xr{-a), 

and so all the polynomials frt and grt are zero. If homomorphism for 

r = 1, . . . ,N, we call the corresponding FC group functor split. 

Theorem 2.4. // F is a field of characteristic zero, then every unipotent group 
defined over F is presented by a split FC group functor over F. 

Proof. Since F has characteristic zero, every unipotent group U defined over F 
is F-unipotent by Proposition 12. Ij By induction, we can assume that U /U n is 
presented by a split FC group functor. Fix maps ?/,■ : Ga — > U/Un, for r = 
l,...,N-l, defining this functor. By [Ser88i Proposition VII.8], Ext(Ga, Ga) = 0. 
Hence Ext(J7/[/Ar, Ga) = 0, by repeated application of the long exact sequence 
for Hom(o,Ga). So there exists a homomorphism y : U/Un U splitting the 
projection U — > U/Un- Define Xr — y ° yr for r = 1, . . . , iV — I. Take xn to be 
the F-injection Ga = Un — > U . These maps clearly define a split FC group functor 
presenting U . □ 

If F has positive characteristic, then the Witt-vector groups }Ser88] provide exam- 
ples of F-unipotent groups which cannot be presented by split FC group functors. 
The full unipotent subgroup of a split reductive group is always presented by a split 
FC group functor, as we show in Proposition 14.1 below. 

3. Collection and symbolic collection 

We now extend some of the standard collection strategies for PC groups to FC 
group functors. The precise order in which the relations are applied has a huge 
impact on the speed of collection. Many strategies have been suggested, and we 
have not attempted to extend them all to FC group functors. We have implemented 
two fundamental techniques: collection from the left [LGS90i IVL90] ; and a slightly 
improved version of collection to the left {HJ76] (we collect the rightmost rather 
than the leftmost occurrence of the least uncollected letter). 

Let U be an FC group functor over F, and let E be a commutative F-algebra. The 
algorithms in this section operate on a word w S C/(E). This word is always equal 
to ni=i [o-iY^ , ie, the parameters M e N, e E, Si = ±1, and G {!,..., N} 
are automatically modified when w is. Algorithm Q] describes the basic step of 
collection. When we say apply a certain relation to a subword", we mean match 
the subword with the left hand side of the relation, and replace it by the right 
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hand side. CollectSubword looks at the term at position j in the word w, and 
either removes an inverse (if ej = —1) or ensures that rj^i < rj. In addition to the 
modified word w, it returns indices ji and Cohection to the left (Algorithm [2| 

CollectSubword := function([/, w = Ilfli ^r, {aiY' , j) 
if Ej = —1 then 

apply (4J to the subword x^^. (oj)^-^ 

ii := j, J2 := j 
else if j > 1 and j j 1 = then 

apply (3) to the subword a;^^ (0^-1)0;,.^ (cj) 

ji ■= j - 1, J2 := Ji + #{i : frjt{ao-i,aj) ^ 0} 
else if j > 1 and 1 > then 

apply (E) to the subword Xrj_i{aj-i)xrj(aj) 

ji := j - 1 

if ji > 1 and r^^-i < rj-^ then 
i2 := .71 

else 

i2 Ji + 1 + #{i : /ir,rj_it(aj-i, ^ 0} 
end if 

else 

j; 32 ■■= 31 + 1 

end if 

return w, ji, j2 

Algorithm 1: Collect subword 

works by collecting all terms xi{a), followed by all terms X2{a), and so on. This 
uses the index ji, which gives the new largest j such that rj = r. Collection from 

Input: An FC group functor U and a word w = YifLi XniciiY' ■ 
Output: A product Y[^=i Xr{br) that is equal to w as an element of t/(E). 
for r := 1 to A^ do 

let j be the largest i such that n = r 
while j > r do 

w,ji,j2 CollectSubword(C/, w, j), j ■= ji 
end while 
end for 
return w 

Algorithm 2: Collection to the left 

the left (Algorithm goes through the word from left to right, correcting each 
term that is out of position. This uses the index j2, which gives the next term 
which is potentially out of position. 

Symbolic collection is a standard method for improving the efficiency of ele- 
ment multiplication in a PC group. This depends on the observation that we can 
collect a generic product, and then substitute into polynomials for subsequent col- 
lections. This is particularly easy in the case of FC group functors: simply take 
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Input: An FC group functor U and a word w — Y[i=i ^riia-iY' ■ 

Output: A product H^i ^r{br) that is equal to w as an element of C/(E). 

while j < M do 

w,jij2 := CollectSubword(C/,'u;, j), j := 
end while 
return w 

Algorithm 3: Collection from the left 
E — ¥[ai,a2, ■ ■ ■ ,aN,b] and do N collections in U{E) to get relations 
(8) I Yl Xs{as)]xr{b) ^ Xr{b)i Yl 

^ s=r+l ^ \s=r+l ^ 

where c^s is a polynomial in b and a^+i, . . . , aN ■ Now, taking arbitrary E again, 
we can multiply two collected words Hil^i ^A^-i) and Hil^i ^i{bi) by substituting 
values from E into the (N —l)N/2 polynomials Crs in the obvious manner. A similar 
method can be used to compute inverses. 

The advantage of symbolic collection is that each operation is faster. The dis- 
advantage is that more preprocessing time and memory are required. In order to 
save memory, we represent our polynomials as straight-line programs (see |LG01] 
for a description of straight-line programs for group elements; the implementation 
for polynomials is due to Allan Steel). This means that the polynomials are basi- 
cally just the collection preserved in amber, so the collection method used is still 
important. We note that there is another common symbolic collection algorithm, 
called Deep Thought }LGS98[ |Mer97] , but we have not implemented it for unipotent 
groups. 



4. Collection in the full unipotent subgroup 

We now describe a new collection strategy for the full unipotent subgroup of 
a reductive group. Let G be an F-split reductive algebraic group [ Spr98] . Fix a 
split maximal torus T in G, and a Borel subgroup B containing T. Let U be the 
unipotent radical of B. Since U is unique up to G-conjugacy, we refer to U as the 
full unipotent subgroup of G. Let be the root system of G with respect to T, and 
let $+ C $ be the positive roots with respect to B. 

Write = {ai, 012, . . . , a^} with the roots in an order compatible with height, 
ie, ht(ar) < ht(as) implies r < s. For each a G $+, there is a subgroup Xa of U 
isomorphic to Ga. Write Xr for the isomorphism Ga — > ATq,^. Then Ur = Y\f=r -^asj 
for r — l,...,iV + l, defines a central series for U. The corresponding FC group 
functor has relations (6} and 

(9) Xs{b)xr(a) ^ Xr(a)xs{b) Xt{G^ja^a,a''b^), 

at —iar+jas , 
t,j>0 

where the constants Cija^os are defined as in [Car72j . Recall that these constants 
depend on the combinatorics of $, and on the choice of a sign for each nonsimple 
positive root. In [CMT04] , a method for computing these constants is given which 
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is efficient for small rank groups. For large ranks, we outline a new method in 
Section \5\ We now have: 

Proposition 4.1. The full unipotent subgroup of a split reductive group is presented 
by a split FC group functor. 

Note that most of the polynomials hrst from (E) of Section [2] are zero in this 
case. This gives us much greater flexibility in how we collect words. The ordering 
of the roots is of vital importance in this section. We specify an ordering in terms 
of subscripts: ai, . . . ,aN. The subscripts on the injections Xr '■ Ga — > U are always 
kept in agreement with the root ordering under discussion. 

Words in U need not be collected into an order compatible with height. In fact, 
the algorithms of the previous section work for all orderings ai,a2, ■ ■ ■ ,ctN of 
with the property that ar + as = at implies t > r and t > s. We call such an 
ordering left-additive. 

An ordering ai,a2, . . . ,aN of is called additive if ar + as = at implies t 
lies between r and s (ie, r < t < s or s < t < r). Additive orderings reflect more 
of the combinatorial structure of the root system than left-additive orderings do. 
The existence and construction of additive orderings is considered in [Pap94] (see 
Section [6] below for more details). 

In order to collect a word into the additive ordering, we replace relation {QJ with 

(10) Xs{b)xr{a) ^ Xria)( Yl XtiCjia^a^a'i-by)) Xs{b), 

i,i>0 

which can be proved by applying to Xr{a)xs{—b). We then use collection from 
the outside (Algorithm 3| , which is a modified version of collection from the left. 
The basic idea is to run collection from both sides simultaneously, until we meet 
in the middle. The two collections could easily be run in parallel, but we alternate 
between them. The subroutine CollectSubword from Algorithm [3] is slightly 
modified for both collections (CollectSubwordL and CollectSubwordR). 
The returned value L is the increase in the word length and k is the index of 
the next term potentially out of position. 

Finally we note that symbolic collection works with collection from the outside, 
with the obvious minor modifications. 

5. The full unipotent subgroup of a classical group 

In this section, we present an alternative to collection which is much more ef- 
ficient when G has large semisimple rank. We derive formulas for the defining 
polynomials of the full unipotent subgroup J7 of a split classical group G. This al- 
lows us to write code that implicitly applies the defining polynomials of U without 
having to store them explicitly in memory. We do not consider exceptional groups 
here, because their semisimple rank is at most 8. 

The rough outline of our method is as follows: We index the roots by pairs of 
integers. We then construct a minimal degree matrix representation of G. We take 
the basis for the representation consisting of weight vectors, ordered according to 
the dominance ordering on the corresponding weights [Hum78j . We then order the 
roots by going down each column of this matrix representation, and seeing where 
the parameters corresponding to each root appear. We call this the representation 
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CollectSubwordL := function(J7, w = Hi^i (0*)'^% j) 
if £j — —1 then 

apply |7J to the subword Xrj{aj)^^ 

k := j, L:=0 
else if j > and j j 1 rj then 

apply (EJ) to the subword a;^^ (aj-i)^^^ (flj) 

k :=]-!, L:=-l 
else if j > and rj_i > Vj then 

apply ( [10]) to the subword Xr^_-i {aj_i)xrj {aj) 

k := j — 1, L := #{i : at = ka^ + la^^i for fc, Z > 0} 

else 

k:^j + l, L:=0 
end if 

return w, A:, L 

CollectSubwordR := function(;7, w = Hfli ^^^(aO^Sj) 
if £j = —1 then 

apply {7J to the subword Xrj{aj)^^ 

k := J, L:=0 
else if j < M and = r^+i then 

apply (EJ to the subword Xrj{aj)xrj{aj+i) 

k:=j, L:=-l 
else if j < M and rj > rj+i then 

apply ( JTOj ) to the subword (aj)a;rj_|_i (aj+i) 

L := ^{t : at = fca^ ^1 + lar for fc, Z > 0}, k := j + I + L 

else 

fc:=j-l, L:==0 
end if 

return w, fc, L 

Input: An FC group functor U and a word w = YifLi XniaiY'- 

Output: A product Y[T=i Xr{br) that is equal to the input as an element of U(E). 

i 1, J := M 
while i < j do 

w, fc, L := CollectSubwordL(C/, w, i), i := fc, M :— M + L, j := j + L 
ii i < j then 

■u;, fc, L := COLLECTSuBWORDR(t/, j := k, M := M + L 

end if 
end while 
return w 

Algorithm 4: Collection from the outside 
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ordering. Note that the representation ordering is also an additive ordering, except 
in Cartan type C^, where it gives an additive ordering on the coroots. 

The representation of roots by pairs of integers can also be used to compute 
the constants Cija^p^ ■ For large classical groups, this is much more efficient than 
the method given in [CMT04] . We omit the details since this is a technical but 
straightforward application of formulas in [Car72) . 

We consider each classical type in Subsections 15.1 to 15.41 In each subsection, 
we fix a particular isogeny type and particular extraspecial signs [Spr98] . The 
choice of isogeny class is irrelevant, since the structure of the unipotent subgroup 
is independent of isogeny. We can easily transform between different choices of 
extraspecial signs using Theorem 29 of [Ste68] . 

5.1. Cartan type A^: Linear of degree £+1. Let G — SL/4.1 and let U be the 

algebraic subgroup of all lower unitriangular matrices. The root system of G has 
Cartan type Ag. Let V = K^+^ with basis ei, . . . , e^+i. Then the roots are 



for i, j = 1, ...,£+ 1 with i ^ j. The simple roots are a^^i+i for i = 1, . . . , i. A root 
aij is positive if, and only if, i < j. Roots add by the formulas a^- + ajm ~ ctim, 
and aij + akm = unless j ~ k 01 i — m. 

Define the map Xij : Ga — » G by Xij (a) = / + aEji . The representation or- 
der on the roots is simply the lexicographic order on the corresponding pairs of 
integers. We label the coordinates of A^^"*"^^^/^ by the root pairs in this order, ie, 
a G A^'^+i^^/^^E) Yia.s the form 

= (ai2, ai3j ■ ■ ■ : OlJ+l, 023, • ■ ■ , 02, £+1, • ■ ■ , Of/+l)- 

We can now define a parametrisation ip : A^^+^^^Z^ ^ U hy 



i+i e+i 

'/'(a) — n n 



1 

012 

013 



1 

O23 



\ 0,14+1 O2/+1 



:,£+ 



1 1/ 



We now get tp{a)(p{b) = (f{c) where 



7 ^ bikOkj 

i<k<j 



Also f(a) 



(f{d) where 



i<k<j 



The formulas for inversion are defined recursively and are computed in reverse 
representation order. We note that it is easy to derive a direct formula for dij, but 
the recursive version can be evaluated with fewer operations. 
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5.2. Cartan type B^: Orthogonal of degree 2t + 1. Let Fm be the m x m 
matrix over F of the form 

/O ... 1\ 
... 1 

\l ... 0/ 

We assume, for this subsection only, that F has odd characteristic. Since the group 
of type is isomorphic to the group of type in characteristic 2, this restriction 
is not critical. Let G = S02^+i be the special orthogonal group of the orthogonal 
form with matrix 

FA 
2 . 

Fe 0/ 

Let U be the group of all lower unitriangular matrices in G. The root system of G 
has Cartan type B^. Let V = with basis ei, . . . , e^. The roots are 

Oisi,tj = SBi - tcj and asi,o = sci, 

for i,j = !,...,£ with i ^ j and s,t = ±1. For the sake of readability, we write 

I instead of — ?' in subscripts, eg, a2.-3 is denoted a23- Note that = aji for all 
i,j = ±1, . . . , ±£. The simple roots are aj,i+i, for i = 1, 1, and a„o- A root 

ai^tj for i,j > is positive if, and only if, i < j; a root 0:^1,0 is positive if, and only 
if,'s = +1. 

Define the root maps by 

Xij{a) — I + a{Eji — E2£-i+2,2i-j+2), 

Xi-j{a) = I + a{E2e-j+2,i — E2t-i+2,j), 

XiQ{a) = / + a{2Ee+i,i - Ei^(+i) - a'^E2e-i+2,i, and 

Xioia) = I + a{E2i-i+2,t+i — 2^^+!, 2^-1+2) — a^Ei^2e-i+2- 

Let Ji be the sequence of integers [i + l,i + 2, . . .,£,0, . . . , — (i + 2), — + Let 
J[ := Ji \ {0}. The representation order on the positive roots is the lexicographic 
order on pairs, with the integers ordered as in Jq. Label the coordinates of A by 
the root pairs in this order. Now define a parametrisation (p : U hy 

i 

<P(a) -=1111 ) 
i=i jeJi 

/ 1 \ 

ai2 1 







1 






2aio • 




2a go 


1 




a-U ■ 








1 



• •■ •■ '^^-i.o '^e-i,e 

ai2 02 •■■ : : : "■• 1 

\ a" ••• "-'u "-'w o-'u ■■■ «'i2 1 / 



COMPUTING IN UNIPOTENT GROUPS 



11 



where 

i<k<e i<k<i 

a'ij = —dij — aika'i^j, 

i<k<j 

a'ij = —ttij - ^ aikd'kj — o-ijO-'j — 2a,oajo - ^ aik^jk- 

i<k<j keJ'j 

Now (p{a)ip{b) = (p{c) where 

CiO = fliO + biO + bikCtkO, Cij = O-ij + bij + bikClkj, 

i<k<l i<k<j 

Cij = ttij + bij+ ^ bikttkj + a'jbij + 26ioajo + ^ bika'^j,. 

i<k<j keJ'j 

And ip{a)~^ = <f{d) where 

diO = ~0'io — ^ ^ dikdkOi dij = —dij — ^ ^ dikO-kji 

i<k<i i<k<j 

dij = —ciij — dikdkj — a'jdij — 2dioa'jQ — dika'jj,. 

i<k<j keJ'j 

All of these equations are recursive and are computed in reverse representation 
order. 

5.3. Cartan type C: Symplectic. Let G = Sp2e be the symplectic group of the 
symplectic form with matrix 

Fe 
-Fe 

Let U be the group of all lower unitriangular matrices in G. The root system of G 
has Cartan type C^. Let V = M.^ with basis ei, . . . , e^. The roots are 

^si,tj — SCi tCj , 

for i,j = with i ^ j, and s,t = ±1. Once again = aji for all i,j = 

0, ±1, . . . , ±£. The simple roots are a,,i+i for i = 1, . . . ,£ — 1, and a^^. A root Q!i,y 
is positive if, and only if, i < j. 
Define the root maps by 

Xij{a) — I + a{Eji — i?2£-i+i,2£-j+i), 
Xij{a) = I + a{E2t-j+i,i + E2t-i+\,j), and 
XiT,{a) = I + aE2t-i+\,i- 

The representation order on the positive roots is the lexicographic order on pairs, 
with the integers ordered as in Jq. Label the coordinates of by the root pairs 
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in this order. Now define a parametrisation ip : 

( 1 

an 1 

au . . . ai-\ i 1 



ai2 «2 
/ _ 

12 



V a'/ a! 



1 

ai2 1 / 



where 



i<j<e i<k<j 

i<fe<j fceJj 
Now (p{a)ip{b) = (p{c) where 

Cij = ttij + bij + X] bikttkj, Cii = a'- +b" + X] (^ik^ik + X ^i^a^^, 

i<k<j i<k<e keJ- 

Cij = ttij + bij + X^ bikUkj + a' -bij + bika'jj^. 

i<k<j keJ'j 

And (p{a)~^ = (p(d) where 

dij = -a^j — X] dikUkj, dii = -a'l + X] c'jfe^ife ~ Xl ^^''^'ik' 



i<k<j 



i<k<t 



keJl 



dij — Uij ^ ^ dikdkj djdij ^ ^ dik^jk' 
i<k<j keJ'. 



5.4. Cartan type D: Even-degree orthogonal. Let G = S02£ be the orthogo- 
nal group of the orthogonal form with matrix F2i. Let U be the group of all lower 
unitriangular matrices in G. The root system of G has Cartan type D^. Let V = M.^ 
with basis ei, . . . , e^. The roots are 

for i,j = with i ^ j and s,t = ±1. The simple roots are ai.j+i, for 

i = 1, and a^_i i- A root ai^tj for i, j > is positive if, and only if, i < j. 

Define the root maps by 

Xij{a) = I + a{Eji - E2i-i+i,2i-j+i) and 
Xij{a) = I + a(£'2^-j+i,i — E2t-i+\,j). 
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The representation order on the positive roots is the lexicographic order on pairs, 
with the integers ordered as in Jq. Label the coordinates of A^^^"^^/^ by the root 
pairs in this order. Now define a parametrisation ip : A^(^~^)/^ U hy 



(p{a) 



i=i jeJ(' 

( 1 

ai2 1 



aw 



ai2 a2 



\ a'{ a 



12 



where 



i<k<j 



keJ' 



a'i2 1 / 



i<j<i i<k<j 



Now (p{a)(f{b) = ip{c) where 

i<k<j 

Cij = ttij + bij + ^ bikttkj + a'-bij + ^ bika'jj,. 

i<k<j 



keJ'. 



And <^(a) ^ = y(d) where 



i<k<j 



i<k<j keJ'. 



6. Analysis and reductive groups 

We can now give precise asymptotic timings for operations in reductive groups 
and their full unipotent subgroups. We give our analysis in terms of the number of 
basic operations in the algebra E: addition, negation, multiplication, and testing 
equality. Once again let G be an F-split reductive algebraic group, with split 
maximal torus T, and Borel subgroup B containing T. Let U be the unipotent 
radical of B. Let W = Ng{T)/T be the Weyl group and let $ be the root system. 
The reflection in W corresponding to the root a is denoted Sa- Let be the 
positive roots with respect to B. 

First we give an analysis for element operations in U{E). 
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Theorem 6.1. Let ¥ be a field and let E be a commutative unital ¥-algebra. Let 
U be the full unipotent subgroup of a split reductive linear algebraic group G over 
F. Let £ be the semisimple rank of G. Then there is a normal form for elements 
of U{¥,). The word problem for elements in normal form requires 0{P) algebra 
operations, and multiplying or inverting them requires 0{l'^) algebra operations. 

Proof. The normal form is a collected word, so the timing for the word problem 
follows from the fact that N = |<i>+| is 0{£^). We can assume that G is simple, 
since [/ is a direct sum of the full unipotent subgroups of the simple components 
of G. 

If G is classical, the formulas of Section require 0{£^) field operations. 

If G is exceptional, then £ is bounded. In this case we use symbolic collection. 
The Hall polynomials of the full unipotent subgroup of a split reductive group are 
independent of the algebra E, since split reductive groups can be constructed as 
Z-schemes. So the number of algebra operations required for inversion or multipli- 
cation is independent of the choice of E. □ 

In the rest of this section, we take E to be an extension field of F and we 
add inversion to the list of basic operations in E. We are primarily interested in 
computing in ?7(E) because it allows us to compute in G(E) with the algorithms of 
[CMT04i Section 5]. Recall that G(E) has a Steinberg presentation with generators 
Xa{a), for a G <& and a G E; n^, for a G <&; and t G T(E). Note that the generator 
Xr{a) of Section can be identified with the generator Xa^{a) of the Steinberg 
presentation. Every element g G G(E) can be written uniquely in Bruhat form: 

g = utwu' , 

for 

• M G ?7(E) stored as a collected word; 

• t G T(E) stored as in [CMT04 ]; 

• w = ■ ■ ■ , where Sa-^ ■ ■ ■ Sa^ is a reduced expression for w G W] and 

• u' G Uw{K) as a collected word, where Uw is the subgroup of U generated 
by the terms Xa{a), for a in := {a G \ aw~^ ^ $+}. 

Given two elements in Bruhat form, we need to find the Bruhat form of their 
product. The usual element operations in [/(E) are not sufficient for this purpose. 
There are two difficult steps, each of which requires a new operation in U. We now 
describe these operations and show how to carry them out with the methods of the 
previous sections. 

6.1. Single-term separation. One difficult step is multiplying g = utwu' by 
Ua for some a G $. This is achieved with Algorithm 3 of [CMT04] . which uses 
the following operation: write u' = Jl/3e* ^pW-li) i^i the form Xa{aa)v where 
V = ri/Se* \{q} •'^isi^p)- '^^11 t^i^ operation single-term separation. 

This is easily done by collection: simply collect the term Xa{aa) to the front 
of the product as in collection to the left, then put v in the required form with 
collection from the outside. No extra terms of the form Xa{b) can appear in v 
because only terms corresponding to roots higher than a are created. We can also 
do single term separation symbolically as in Section [31 

Alternatively, for classical groups, we can compute v as the product Xai—aa)u' 
using the formulas of Section [51 If a = aij , then the only possible nonzero constants 
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in (^(a) areoij, a^^, and a". Hence at most O(i') of the formulas for Cy are nontrivial. 
Each such formula has at most a constant number of nonzero terms. We now have: 

Proposition 6.2. Single-term separation in U(E) requires 0{£) field operations. 

Proof. Use formulas for classical components and symbolic collection for exceptional 
components. □ 

Note that, when both of the elements being multiplied are in Bruhat form, Al- 
gorithm 3 of [CMT04] only uses single-term separation for a simple. We have 
considered nonsimple roots as well, because they will be useful in the next subsec- 
tion. 

6.2. Weyl separation. The other difficult step for multiplication in G is comput- 
ing the product of g £ G and v G U. Write g in Bruhat form utwu' . Then multiply 
u' and w, and decompose the product into the form v"v' where 

v" = Xa{ba) and = Xa{ha). 

We call this operation Weyl separation. We now get the Bruhat form 

gv = 

where (w")"" ^* ^ is in C/ since a G \ $tu implies aw~^ is positive. 

If we take the elements of \ in an order compatible with height, followed 
by the elements of in an order compatible with height, we get a left-additive 
ordering on . So the algorithms of Section ^ can also be used for separation. 
But note that {v"Y" * will need to be collected again, since the image of the left 
additive ordering on $+ \ under w^^ need not be left additive. 

We can also use collection from the outside for Weyl separation. We need the 
following classification of additive ordcrings from [Pap94] : 

Theorem 6.3. Let w be an element of the Weyl group W . Let S/jj • • • sp^ he a 
reduced expression for w. Then 

is an additive ordering on All additive orderings on $^ arise from reduced 

expressions in this manner. 

Now let Wo be the longest word in W and fix a reduced expression Sai ■ ■ ■ for '"'o 
(in practice, we use the lexicographically least reduced expression, but this is not 
necessary). We use the additive ordering on corresponding to this reduced 
expression as the fixed order for collection. Now let w be a Weyl group element. If 
we restrict the fixed ordering to we get an additive ordering, with corresponding 
reduced expression s^^ . . . s^,„ = w. Similarly we restrict to get an ordering on 
^uioui-i = \ ^■w)w~^ and a corresponding reduced expression Sj-^ . . . s-y^^^ = 
WoW~^. Now Wo = S71 • ■ ■ S7jv_,„s/3i ■ ■ • S/3,„ is reduced. The corresponding ordering 
is: our fixed ordering restricted to \ and transformed by w, followed by 
our fixed ordering restricted to This is precisely the ordering we need for 

separation. 

Finally we analyse Weyl separation: 

Proposition 6.4. Weyl separation in U requires 0{i^) field operations. 
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Proof. For classical components, we apply single-term separation for each root in 
By Proposition 16.21 this takes 0{N£) — 0{£^) operations. For exceptional 
components use symbolic collection. □ 

In the exceptional case, this proposition assumes we have a system of symbolic- 
collection polynomials for every Weyl element. Although this is polynomial time, 
the memory required to store all these polynomials is prohibitive. In practice, it is 
much faster to use collection from the outside for Weyl separation in exceptional 
groups. 

6.3. Operations in reductive groups. We now prove the following result on 
computation in G: 

Theorem 6.5. Let ¥ be a field and let E be an extension of F. Let G be a split 
reductive linear algebraic group over the field F. Let i be the semisimple rank of G 
and let n be the reductive rank. Then there is a normal form for elements o/G(E). 
The word problem for elements in normal form requires 0{n + i'^) field operations, 
and multiplying or inverting them requires 0{nl^) field operations. 

Proof. We use the Bruhat decomposition to store g G G in the normal form g — 
uhwu'. Here u, u', and w are words of length at most N, while h has length 
n. Once again the timing for the word problem is clear. Now G is a central 
product of simple algebraic groups and a central torus of dimension at most n. 
Multiplying a toral element by an element in a simple component is done as in 
Subsection 5.5 of [CMT04] . and takes time 0{n£'^). So it suffices to show that 
multiplication and inversion in a simple group G requires 0(£^) operations. The 
algorithms for multiplication and inversion given in [CMT04] require a constant 
number of multiplications or inversions in ?7(E), together with a constant number 
of Weyl separations and at most 0{£^) single-term separations. The theorem now 
follows from Theorem 16.11 Proposition 16. 21 and Proposition 16.41 □ 

Theorem 1.1 is an immediate consequence of this result and the fact that £ < n. 
7. Implementation and timings 

A number of heuristic improvements are built into our implementations of the 
algorithms described. Most of them are either obvious or were suggested by our 
profiling of the code. We restrict ourselves here to a brief description of the basic 
data types used. Representations of elements of the field F or algebra E are taken 
care of by the Magma computer algebra system [BCP97) . Most of our code is 
written in traditional C |KR88J and incorporated into the Magma core. Less time- 
critical code is written in the Magma language itself. 

A collected product n!^i^»-(ar) is stored as a sequence [oi, . . . , oat]. While 
doing the collection, we represent a term Xr{a) as a pair (r, a) of an integer and 
an element of E. Note that pairs (r, 0) are trivial - they are always eliminated as 
soon as they occur. A word Y[i=i Xri{o,iY^ is represented as a doubly linked chain. 
That is, every root element in the chain contains a reference to its predecessor 
and successor, which is a null-reference if the element is the first (resp. last) in the 
chain: 

We use this data structure because inserting and deleting terms in the word when 
applying relations ©-(EI) can be done in constant time. For sequences, insertion 
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and deletion would be more expensive, since the tail of the sequence has to be 
copied in memory. The chain is doubly linked, since we need both the predecessor 
and the successor of a term in the word for the CollectSubword functions. 
In our tables we use the following abbreviations for collection algorithms: 

CTL: Collection to the left. Section SI 

CFL: Collection from the left. Sectional 

CFO: Collection from the outside. Section 

SCFL: Symbolic collection from the left, Sectional 

SCFO: Symbolic collection from the outside, Section 

We have two different implementations of the method of Section [5J 

D: Modified matrix multiplication. For given a, 6, we use formulas of Section[5] 
to compute ip{a) and the significant part of if{b) (we do not use b[j and, 
except in types B (even characteristic) and C, we do not use 6"). Then the 
product of the two matrices is computed using algorithms implemented in 
the Magma computer algebra system [BCP97] . The resulting matrix agrees 
with ip{c) in the entries and in the entries c" (where they are needed). 
Thus we can recover the product c = ab from the resulting matrix by 
formulas of Section \5\ 

SD: Compute the polynomials of fS) in Section [SJ using the formulas instead 
of collection. 

Method D outperforms SD in most cases, since asymptotically fast algorithms are 
used for matrix multiplication. But SD is faster for fields with very rapid blow-up 
of terms, such as multivariate rational function fields. All timings were run on an 
AMD Opteron 150 Processor with 2393 MHz. 

Table Q] gives times and memory consumption for creating the reductive groups 
and precomputing all constants. For symbolic algorithms, this also includes time 
taken to compute the polynomials. Note that all constants and polynomials are 
independent of the field, and are computed on a per-root-datum basis. This means 
that preprocessing time is nearly zero if a group with the same root datum has 
already been created in the same Magma session. We used a workspace of 4 gi- 
gabytes - when this is insufficient we do not give a time and write > AGB in the 
memory column. In columns D and SD, the constants are computed as they are 
needed and not stored in memory. 

Table [2] gives average times for multiplying and inverting random elements of 
full unipotent groups over the field with 17 elements. The average is taken over 100 
multiplications. The same random elements are used for different algorithms. If a 
single multiplication required more than 2 gigabytes of memory, we write > 2GB 
instead of a time. We did not attempt those cases where the preprocessing took 
more than 4 gigabytes of memory. 

Table [3] gives similar times for multiplying and inverting random elements of 
the reductive group itself. Each such operation involves a number of collections. 
Computing random elements in a reductive group can be time consuming, but this 
is not included in our timings. 

Table gives average times for multiplying and inverting random elements of full 
unipotent groups of reductive groups over different fields. Over the field of rational 
numbers, the random field elements are chosen by taking a random numerator and 
a random denominator of size up to the given number of bits and a random sign. 
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1.147 


1.080 


1.546 


1.112 


1.982 


2.031 


A3o(17) 


120.524 


10.798 


7.887 


21.833 


8.936 


29.141 


29.752 


104.102 


13.639 


12.655 


21.838 


13.484 


21.959 


22.627 


Aioo(17) 






























Bio(17) 


0.423 


0.325 


0.304 


0.453 


0.414 


1.353 


1.359 


0.561 


0.495 


0.486 


0.592 


0.570 


0.860 


0.867 


B2o(17) 


342.015 


9.964 


5.184 


28.542 


7.234 


27.707 


29.383 


237.331 


10.345 


8.721 


24.834 


10.331 


18.471 


19.728 


B3o(17) 


> 2GB 


170.359 


34.042 


560.664 


228.163 


190.445 


198.912 


> 2GB 


92.252 


51.170 


397.654 


197.548 


140.195 


146.980 


Bioo(17) 






























Cio(17) 


0.430 


0.320 


0.314 


0.455 


0.334 


1.420 


1.426 


0.556 


0.488 


0.483 


0.588 


0.502 


0.893 


0.905 


C2o(17) 


347.298 


10.341 


5.380 


28.422 


14.166 


27.950 


29.116 


225.542 


10.033 


8.517 


24.409 


15.189 


18.399 


19.387 


C3o(17) 


> 2GB 


174.191 


33.764 


580.168 


116.204 


181.660 


190.782 


> 2GB 


99.784 


54.166 


414.583 


115.765 


138.295 


145.607 


Cioo(17) 






























Dio(17) 


0.344 


0.280 


0.268 


0.317 


0.278 


1.170 


1.184 


0.454 


0.412 


0.409 


0.441 


0.416 


0.727 


0.734 


D2o(17) 


225.229 


8.183 


4.606 


24.562 


10.825 


23.712 


24.692 


166.790 


8.538 


7.340 


21.308 


12.015 


15.674 


16.571 


D3o(17) 


> 2GB 


147.254 


31.131 


521.051 


90.824 


174.995 


185.490 


> 2GB 


82.122 


47.233 


373.211 


94.738 


135.237 


142.438 


Dioo(17) 






























G2(17) 


0.008 


0.008 


0.006 


0.008 


0.008 






0.006 


0.006 


0.006 


0.006 


0.006 






F4(17) 


0.080 


0.076 


0.075 


0.077 


0.074 






0.060 


0.060 


0.059 


0.059 


0.060 






E6(17) 


0.172 


0.166 


0.158 


0.165 


0.158 






0.128 


0.129 


0.124 


0.127 


0.126 






£7(17) 


0.774 


0.508 


0.451 


0.518 


0.455 






0.419 


0.369 


0.363 


0.381 


0.366 






Eg (17) 


52.481 


3.098 


1.566 


3.815 


1.624 






8.781 


1.396 


1.288 


1.974 


1.353 







S3 
H 

Pi 
< 



Table 3. Average time to multiply random elements of the reductive group 
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Group 




Multiply 






Invert 












CFO 


SCFO 


D 


SD 


CFO 


SCFO 


D 


SD 


A2o(2) 




0.021 


0.030 


0.020 


0.031 


0.005 


0.017 


0.003 


0.015 


A2o(17) 




0.029 


0.031 


0.020 


0.032 


0.012 


0.018 


0.003 


0.017 


A20(Q), 


32 bits 


0.045 


0.038 


0.024 


0.038 


0.051 


0.109 


0.213 


0.135 


A20(Q), 


64 bits 


0.049 


0.041 


0.028 


0.041 


0.086 


0.195 


0.736 


0.265 


A20(Q), 


128 bits 


0.056 


0.047 


0.039 


0.047 


0.177 


0.394 


2.877 


0.611 


A2o(Q(i)). 32 bits 


0.074 


0.052 


0.047 


0.049 


0.080 


0.143 


0.048 


0.117 


A20(Q(P) 


), 32 bits 


0.108 


0.069 


0.062 


0.062 


0.122 


0.265 


0.068 


0.214 


A20(fl) 




0.049 


0.038 


0.025 


0.038 


6.868 


2.562 


0.594 


1.609 


B2o(2) 




0.056 


0.589 


0.047 


0.303 


0.023 


0.590 


0.015 


0.282 


B2o(17) 




0.174 


0.592 


0.049 


0.309 


0.125 


0.596 


0.017 


0.289 


B20(Q), 


32 bits 


0.420 


0.811 


0.254 


1.901 


0.986 


2.630 


2.625 


5.169 


B20(Q), 


64 bits 


0.534 


0.960 


0.630 


3.594 


2.205 


5.111 


11.937 


12.567 


B20(Q), 


128 bits 


0.798 


1.269 


1.865 


8.058 


5.575 


11.475 


50.818 


34.318 


B2o(Q(j)). 32 bits 


0.680 


0.972 


1.078 


2.124 


1.270 


2.654 


1.344 


3.874 


B20(Q(p) 


), 32 bits 


1.092 


1.362 


1.410 


3.864 


2.276 


4.542 


2.053 


7.212 


B20(fl) 




0.589 


0.777 


> 2GB 


31.884 


> 2GB 


> 2GB 


> 2GB 


> 2GB 


E8(2) 




0.012 


0.028 






0.003 


0.022 






E8(17) 




0.016 


0.029 






0.007 


0.023 






EsCQ), 


32 bits 


0.047 


0.125 






0.061 


0.283 






E8(Q), 


64 bits 


0.075 


0.206 






0.118 


0.526 






E8(Q), 


128 bits 


0.141 


0.390 






0.262 


1.099 






E8(Q(i)), 


32 bits 


0.077 


0.162 






0.094 


0.310 






E8(Q(p)) 


32 bits 


0.113 


0.291 






0.146 


0.621 






E8(-R.) 




0.315 


0.349 






0.904 


1.166 







Table 4. Operations for random elements of the full unipotcnt 
group over different fields 





CFL 


CFO 




max 


avg 


max 


lim avg 


A, 
Be 
Ce 
Df 


£ 

2i - 1 
2£ ~ 1 
21-3 


(^ + 2)/3 
(21+1- i)/3 
(2i+^- 57)/3 
(21 - l)/3 


2 
4 
3 
3 


2 
4 
3 
3 



Table 5 . Total degrees of Hall polynomials 



Similar random elements were used for the Gaussian integers Q(i) and for Q{p), 
which is the splitting field of a random irreducible polynomial of degree 6 with 
integral coefficients in the range 1 to 10. The field R is the multivariate rational 
function field over Q with 10 variables. Random field elements over R were taken 
to be random invariates. In B2o(i?) the coefficient blowup is so large that over 
2 gigabytes of memory was needed in some cases (see entries in the table). 

Finally, in Table \5\ we compare the total degrees of the polynomials used for dif- 
ferent kinds of symbolic collection. The last column contains limf^oo avg{deg(p) : 
p e P}, where V is the set of polynomials used for symbolic collection. This goes a 
long way towards explaining why collection from the outside works so well. Since 
the polynomials are multivariate in 2N = 2|$+| variables, we still can have large 
polynomials. To have a very rough idea of the size, we printed the largest of the 
polynomials in type B15 as a string and measured its size in bytes. Using collection 
from the outside, the size is 9226 bytes; using collection from the left, the size is 
about 297 megabytes. 
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